A Large-scale Batch-learning Self-organizing Map for Function Prediction of Poorly-characterized Proteins Progressively Accumulating in Sequence Databases : Annual Report of the Earth Simulator Center April 2007 - March 2008
نویسندگان
چکیده
As a result of decoding of extensive genome sequences, a large number of proteins whose function cannot be predicted by the homology search of amino acid sequences is progressively accumulated and thus remains of no use in science and industry. A method to predict the protein function that does not depend on the sequence homology search is in urgent need. We previously developed a Batch-Learning SOM (BLSOM) for genome informatics, and in the present report, we describe use of the BLSOM method for prediction of protein function on the basis of similarity in composition of oligopeptides (di-. triand tetrapeptide in this study) of proteins. Oligopeptides are elementary components of a protein and involved in formation of functional motifs and structural organization of proteins. BLSOM for oligopeptides could extract characteristics of oligopeptide composition actualizing protein structure and function and thus predict functions.
منابع مشابه
A Large-scale Batch-learning Self-organizing Map for Function Prediction of Poorly-characterized Proteins Progressively Accumulating in Sequence Databases
Homology searches for nucleotide and amino-acid sequences have been used widely to predict functions of genes and proteins when genomes are decoded and thus become a basic bioinformatics tool. Whereas usefulness of the sequence homology search is apparent, it has become increasingly clear that homology search can predict the protein function of only 50% of genes, or fewer, when a novel genome i...
متن کاملThe Time Adaptive Self Organizing Map for Distribution Estimation
The feature map represented by the set of weight vectors of the basic SOM (Self-Organizing Map) provides a good approximation to the input space from which the sample vectors come. But the timedecreasing learning rate and neighborhood function of the basic SOM algorithm reduce its capability to adapt weights for a varied environment. In dealing with non-stationary input distributions and changi...
متن کاملDevelopment of General Purpose Numerical Software Infrastructure for Large Scale Scientific Computing : Annual Report of the Earth Simulator Center April 2007 - March 2008
The Development of Software Infrastructure for Large Scale Scientific Simulation project, or the Scalable Software Infrastructure (SSI) project for short, was initiated in November 2002, for the purpose of constructing a scalable software infrastructure to expand large scale computing environments to replace existing implementations of parallel algorithms and implementations in individual scien...
متن کاملLarge Scale MD Simulations of Proteins on the Earth Simulator: Quaternary Structural Changes of Hemoglobin : Annual Report of the Earth Simulator Center April 2007 - March 2008
The purpose of our group is to computationally demonstrate large structural changes of hemoglobin using COSMOS90 which was accelerated on the Earth Simulator by vectorization and parallelization for all subroutines. COSMOS90 can efficiently simulate proteins in the realistic conditions i.e., in water with all degrees of freedom and long-range Coulomb interactions. Hemoglobin consists of four sm...
متن کاملSelf-Burning: a Common and Tragic Way of Suicide in Fars Province, Iran
Self-burning is the most devastating burn injury. It is a common social and medical problem in Iran. In a longitudinal prospective study, from April 2003 to March 2006, all burn patients admitted to Ghotb-eddin burn Hospital were enrolled in this study. Suicide attempts by burning accounted for 283 (21.9%) of all burn patients admitted to the hospital. Most (68.2%) of self-burning patients were...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008